A novel projection-based likelihood measure for noisy speech recognition

نویسندگان

Jen-Tzung Chien

Hsiao-Chuan Wang

Lee-Min Lee

چکیده

The projection-based likelihood measure, an eective means of reducing noise contamination in speech recognition, dynamically searches an optimal equalization factor for adapting the cepstral mean vector of hidden Markov model (HMM) to equalize the noisy observation. In this paper, we present a novel likelihood measure which extends the adaptation mechanism to the shrinkage of covariance matrix and the adaptation bias of mean vector. A set of adaptation functions is proposed for obtaining the compensation factors. Experiments indicate that the likelihood measure proposed herein can markedly elevate the recognition accuracy. Ó 1998 Elsevier Science B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech recognition in noisy environment using weighted projection-based likelihood measure

This paper investigates a projection-based likelihood meaure that improves speech recognition performance in noisy environment. The projection-based likelihood measure is modi ed to give the weighting and projection e ect and to reduce computational complexity. It is evaluated in sub-model based word recognition using semi-continuous hidden Markov model with speaker independent mode. Experiment...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

IMPROVED HMM ENTROPY FOR ROBUST SUB−BAND SPEECH RECOGNITION (ThuPmOR1)

In recent years, sub−band speech recognition has been found useful in robust speech recognition, especially for speech signals contaminated by band−limited noise. In sub−band speech recognition, full band speech is divided into several frequency sub−bands and then sub−band feature vectors or their generated likelihoods by corresponding sub−band recognizers are combined to give the result of rec...

متن کامل

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

Speech Communication

دوره 24 شماره

صفحات -

تاریخ انتشار 1998

A novel projection-based likelihood measure for noisy speech recognition

نویسندگان

چکیده

منابع مشابه

Speech recognition in noisy environment using weighted projection-based likelihood measure

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

IMPROVED HMM ENTROPY FOR ROBUST SUB−BAND SPEECH RECOGNITION (ThuPmOR1)

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

عنوان ژورنال:

اشتراک گذاری